Evaluation of Glottal Epoch Detection Algorithms on Different Voice Types
نویسندگان
چکیده
According to the source-filter model of speech production, speech can be represented by passing the excitation signal through the vocal tract filter. The epoch or instant of maximum excitation corresponds to the glottal closure instant. Several speech processing applications require robust epoch detection but this can be a difficult task. Although state-of-the-art epoch estimation methods can produce reliable results, they are generally evaluated using speech recorded with a neutral voice quality (modal voice). This paper reviews and evaluates six popular algorithms for the calculation of glottal closure instants on speech spoken with modal voice and seven additional voice qualities. Results show that the performance of each method is affected by the voice type and that some methods perform better than others for each voice quality.
منابع مشابه
Steady Flow Through Modeled Glottal Constriction
The airflow in the modeled glottal constriction was simulated by the solutions of the Navier-Stokes equations for laminar flow, and the corresponding Reynolds equations for turbulent flow in generalized, nonorthogonal coordinates using a numerical method. A two-dimensional model of laryngeal flow is considered and aerodynamic properties are calculated for both laminar and turbulent steady flows...
متن کاملA quantitative comparison of glottal closure instant estimation algorithms on a large variety of singing sounds
Glottal closure instant (GCI) estimation is a well-studied topic that plays a critical role in several speech processing applications. Many GCI estimation algorithms have been proposed in the literature and shown to provide excellent results on the speech signal. Nonetheless the efficiency of these algorithms for the analysis of the singing voice is still unknown. The goal of this paper is to a...
متن کاملEpoch-based analysis of speech signals
Speech analysis is traditionally performed using short-time analysis to extract features in time and frequency domains. The window size for the analysis is fixed somewhat arbitrarily, mainly to account for the time varying vocal tract system during production. However, speech in its primary mode of excitation is produced due to impulse-like excitation in each glottal cycle. Anchoring the speech...
متن کاملAutomatic detection of creaky voice using epoch parameters
This paper proposes a method based on epoch parameters for detection of creaky voice in speech signal. The epoch parameters characterizing the source of excitation considered in this work are number of epochs in a frame, strength of excitation of epochs and epoch intervals. Analysis of epoch parameters estimated from zero-frequency filtering method with different window sizes is carried out. Di...
متن کاملZero Frequency Filter Based Analysis of Voice Disorders
Pitch period and amplitude perturbations are widely used parameters to discriminate normal and voice disorder speech. Instantaneous pitch period and amplitude of glottal vibrations directly from the speech waveform may not give an accurate estimation of jitter and shimmer. In this paper, the significance of epochs (glottal closure instants) and strength of excitation (SoE) derived from the zero...
متن کامل